Voice conversion algorithm based on piecewise linear conversion rules of formant frequency and spectrum tilt

نویسندگان

  • Hideyuki Mizuno
  • Masanobu Abe
چکیده

This article presents a new algorithm used in order to convert the speech of one speaker so that it sounds like that of another speaker. This algorithm flexibly converts voice quality using two major technical developments. Firstly, the modification of formant frequencies and spectral intensity using piecewise linear voice conversion rules. This enables the control of spectrum parameters in detail. The conversion rules are generated automatically for any pair of speakers. The reliability of the conversion rules is guaranteed because they are statistically generated using training data. Secondly, this algorithm provides the ability to produce speech with the desired formant structure by controlling formant frequencies, formant bandwidths and spectral intensity. Speech is iteratively modified in order to achieve the specified formant structure. Listening tests prove that the proposed algorithm converts speaker individuality while maintaining high speech quality.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Efficient Algorithm for Voice Gender Conversion

Realistic Voice Gender Conversion (VGC) requires independent scaling of the glottal (pitch) and vocal tract (formant) related features of the input speech signal. We present a VGC algorithm which has two novel features. Firstly, an efficient frequency scaling algorithm is presented. Secondly, we use this to scale all frequencies in the input signal by the desired formant scaling factor. We then...

متن کامل

Using Context-based Statistical Models to Promote the Quality of Voice Conversion Systems

This article aims to examine methods of optimizing GMM-based voice conversion systems performance in which GMM method is introduced as the basic method for improvement of voice conversion systems performance. In the current methods, due to using a single conversion function to convert all speech units and subsequent spectral smoothing arising from statistical averaging, we will observe quality ...

متن کامل

Probability models of formant parameters for voice conversion

This paper explores the estimation and mapping of probability models of formant parameter vectors for voice conversion. The formant parameter vectors consist of the frequency, bandwidth and intensity of resonance at formants. Formant parameters are derived from the coefficients of a linear prediction (LP) model of speech. The formant distributions are modelled with phonemedependent two-dimensio...

متن کامل

不需平行語料而基於共振峰與線頻譜頻率映對之語者特質轉換系統 (A Voice Conversion System based on Formant and LSF Mapping without Using Parallel Corpus) [In Chinese]

Voice conversion has been used in many applications. The methods based on vector quantization codebook and Gaussian mixture models need dynamic time warping on parallel sentence corpus for generating mapping functions. Recent study tries to use less training data, and even without parallel sentence corpus. This paper presents a voice conversion method without using parallel sentence corpus. It ...

متن کامل

Voice Conversion technology is a new technology

In this paper, we put forward a time-domain female-male voice conversion algorithm. This method mainly focuses on two acoustic features that are thought to be the most important to speech individuality: pitch frequency and formant frequencies. To change pitch frequency, we cut off or add the low amplitude parts of speech signals in one pitch period. To change formants, according to the relation...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Speech Communication

دوره 16  شماره 

صفحات  -

تاریخ انتشار 1995